Compositional complexity of DNA sequence models

نویسندگان

  • P. Bernaola-Galván
  • P. Carpena
  • R. Román-Roldán
چکیده

Recently, we proposed a new measure of complexity for symbolic sequences (Sequence Compositional Complexity, SCC) based on the entropic segmentation of a sequence into compositionally homogeneous domains. Such segmentation is carried out by means of a conceptually simple, computationally efficient heuristic algorithm. SCC is now applied to the sequences generated by several stochastic models which describe the statistical properties of DNA, in particular the observed long-range fractal correlations. This approach allows us to test the capability of the different models in describing the complex compositional heterogeneity found in DNA sequences. Moreover, SCC detects clear differences where conventional standard methods fail.  1999 Elsevier Science B.V. All rights reserved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SEGMENT: identifying compositional domains in DNA sequences

MOTIVATION DNA sequences are formed by patches or domains of different nucleotide composition. In a few simple sequences, domains can simply be identified by eye; however, most DNA sequences show a complex compositional heterogeneity (fractal structure), which cannot be properly detected by current methods. Recently, a computationally efficient segmentation method to analyse such nonstationary ...

متن کامل

کیانیت در هاله‌های دگرگونی

Different models are discussed to interpret the presence of kyanite as a stable phase in contact aureoles especially in the andalusite-, sillimanite-bearing aureoles. In such aureoles the polymorphic sequence kyanite → andalusite → sillimanite can be explained in terms of an essentially isobaric path during which kyanite initially crystallises from the breakdown of pyrophyllite or more likely m...

متن کامل

Application of Task Complexity Along +/- single Task Dimension and its Effect on Fluency in Writing Performance of Iranian EFL Learners

In the present study, two different models of task complexity; namely, limited attentional capacity model and cognition hypothesis were examined. To this end, the manipulation of cognitive task complexity along +/- single task dimension on Iranian EFL learners’ production in terms of fluency was explored. Based on the results of the writing test of TOFEL (2004), 48 learners were selected as the...

متن کامل

Application of Task Complexity Along +/- single Task Dimension and its Effect on Fluency in Writing Performance of Iranian EFL Learners

In the present study, two different models of task complexity; namely, limited attentional capacity model and cognition hypothesis were examined. To this end, the manipulation of cognitive task complexity along +/- single task dimension on Iranian EFL learners’ production in terms of fluency was explored. Based on the results of the writing test of TOFEL (2004), 48 learners were selected as the...

متن کامل

The Measure of Compositional Heterogeneity in DNA Sequences Is Related to Measures of Complexity

DNA sequences store the complete genetic information of a biological organism. Understanding the “genetic language” in DNA sequences is the ultimate goal of the Human Genome Project, which will have a profound impact on biology, medicine, and human society [1]. In one sense, the genetic language written in DNA sequences is simpler than the English language because it is composed of only four le...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999